Searching for molecules with similar biological activity: analysis by fingerprint profiling.
نویسندگان
چکیده
We have recently developed a mini-fingerprint (MFP) representation for small molecules that performs well in database searches for compounds with similar biological activity. The MFP consists of only 54 bit positions that account for numerical ranges of three two-dimensional (2D) descriptors or the presence or absence of defined structural fragments. Here we present an analysis method, termed fingerprint profiling, to systematically compare bit patterns of compounds belonging to different biological activity classes. Some but not all bit positions were variably occupied in seven different activity classes and responsible for the detection of structure-activity differences. The analysis has made it possible to rank bit positions and encoded molecular descriptors according to their importance for our similarity search calculations. Fingerprint profiling can be applied to any keyed bit string representation and should be helpful, for example, to analyze descriptor distributions in large compound databases.
منابع مشابه
Similarity-based data mining in files of two-dimensional chemical structures using fingerprint measures of molecular resemblance
This paper reviews the use of measures of inter-molecular similarity for processing databases of chemical structures, which play an important role in the discovery of new drugs by the pharmaceutical industry. The similarity measures considered here are based on the use of a fingerprint representation of molecular structure, where a fingerprint is a vector encoding the presence of fragment subst...
متن کاملHigh-Performance Thin-Layer Chromatographic Fingerprints of Flavonoids and Phenol Carboxylic Acids for Standardization of Iranian Species of the Genus Crataegus L.
Eight samples of flowering tops from six species of the genus Crataegus L., commonly called Hawthorn, from different geographic locations of Iran were standardized according to German Pharmacopoeia monograph on Crataegi folium cum flore (hawthorn leaf with flower) by high-performance thin-layer chromatograph-ic (HPTLC) fingerprinting combining with digital scanni...
متن کاملFast small molecule similarity searching with multiple alignment profiles of molecules represented in one-dimension.
Multiple sequence alignment has proven to be a powerful method for creating protein and DNA sequence alignment profiles. These profiles of protein families are useful tools for identifying conserved motifs, such as the catalytic triad of the serine protease family or the seven transmembrane helices of the G-protein coupled receptor family. Ultimately, the understanding of the critical motifs wi...
متن کاملIn silico Screening and Evaluation of the Anticonvulsant Activity of Docosahexaenoic Acid-Like Molecules in Experimental Models of Seizures
Background: Resistance to antiepileptic drugs as well as intolerability in 20-30% of the patients raises demand for developing new drugs with improved efficacy and safety. Acceptable anticonvulsant activity, good tolerability, and inexpensiveness of docosahexaenoic acid (DHA), make it as a good candidate for designing and development of the new anticonvulsant medications. Methods: Ten DHA-based...
متن کاملCombination of fingerprints and MCS-based (inSARa) networks for Structure-Activity-Relationship analysis
Structure-Activity-Relationship (SAR) analysis of small molecules is a fundamental and challenging task in drug discovery. The knowledge of these relationships between chemical structure and bioactivity is of high value for the medicinal chemist, e.g., in the lead-optimization process or de-novo-design. In order to analyse SARs, the recognition of molecular similarities is a crucial step due to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره شماره
صفحات -
تاریخ انتشار 2000